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ABSTRACT 

An important component of the Extragalactic Distance Database (EDD) at 
littp://edd.ifa.liawaii.edu is a group of catalogs related to the measurement of HI line profile 
parameters. One of these is the All Digital HI catalog which contains an amalgam of information 
from new data and old. The new data results from observations with Arecibo and Parkes telescopes 
and with the Green Bank Telescope (GBT). including continuing input since the award of the NRAO 
Cosmic Flows Large Program. The old data has been collected from archives, wherever available, 
particularly the Cornell University Digital HI Archive, the Nangay Telescope extragalactic HI archive, 
and the Australia Telescope HI archive. The catalog currently contains information on ~ 15,000 
profiles relating to ^ 13, 000 galaxies. The channel flux per channel files, from whatever source, 
are carried through a common pipeline. The derived parameter of greatest interest is Wm5o, the 
profile width at 50% of the mean flux. After appropriate adjustment, the parameter Wmx is derived, 
the linewidth which statistically approximates the peak to peak maximum rotation velocity before 
correction for inclination, 2\^aj;Sini 

Subject headings: astronomical data base; catalogs; galaxies: distances; radio lines: galaxies 



1. INTRODUCTION 

The goal of the overall program facilitated by the Ex- 
tragalactic Distance Database (EDD) is to obtain the 
densest and deepest possible coverage of galaxy distances 
and, hence, of line-of-sight peculiar velocities. We want 
to improve the local determination of the Hubble Con- 
stant and measure departures from the cosmic expansion 
that presumably can be attributed to the distribution of 
matter. We are giving consideration to 7-10 different 
methods for deriving distances. One of these relies on 
the correlation between galaxy luminosities and rotation 
rates (Tully & Fisher 1977). While there are other meth- 
ods that provide more accurate individual measures of 
distance, the luminosity -rotation rate correlation retains 
an importance. The method can be applied to roughly 
half of all spiral galaxies - those that are suitably in- 

*The National Radio Astronomy Observatory is a facility of the 
National Science Foundation, operated under cooperative agree- 
ment by Associated Universities, Inc. 



clined and not confused or disrupted by companions - 
over a wide range of distances. Highly complete samples 
of many thousands of galaxies can be obtained extend- 
ing to 40-100 Mpc, constrained only by manpower and 
competition for access to telescopes. 

The measurement of a distance requires separate ob- 
servations of luminosities and rotation rates. The latter 
can be determined through spectroscopy in either opti- 
cal or radio domains but care must be taken if sources 
are mixed (Courteau 1997; Catinella et al. 2007). Usu- 
ally radio spectra are obtained with beams that encom- 
pass and integrate the flux over an entire target, pro- 
viding a global line profile. Information obtained with 
radio telescopes typically provide higher spectral resolu- 
tion than optical alternatives. Spatial coverage is almost 
always more extensive. To date, optical spectra have 
been obtained for distance measurements over a greater 
range (Dale et al. 1999), although respectable HI spectra 
have been obtained with Arecibo Telescope at redshifts 
as great as 2; 0.2 (Catinella et al. 2008). One wants 
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ultimately to reconcile optical and radio rotation curve 
information. This article focuses on the narrower issue 
of the measurement of rotation rates from radio observa- 
tions of the neutral Hydrogen 21 cm line. The purpose 
of the present discussion is to integrate the considerable 
amount of neutral Hydrogen spectral data obtained by 
ourselves and others through a coherent analysis. 

The study involves the following elements. First, there 
is a review of the status of neutral Hydrogen observations 
as they pertain to our program. Then, we discuss our 
own recent observations with the Arecibo, Green Bank, 
and Parkes telescopes. A reduction procedure has been 
developed to analyze the data. Our observations incre- 
mentally expand on the large body of material available 
in the archives of radio observatories around the world. 
There is a discussion of the unification of all the avail- 
able digital data, analyzed in a common way. The results 
arc provided in tabular and graphical forms in EDD, the 
Extragalactic Distance Database (TuUy et al. 2009). See 
http://edd.ifa.hawaii.edu; select the catalog All Digital 
HI. 

1.1. Historical Background 

Motions within galaxies are a response to the gravi- 
tational potential. If the HI gas is in equilibrium in a 
disk, rotating in circular orbits, then there is a simple 
relationship between the observed motions and the dis- 
tribution of mass. The small dispersion in the relation 
between rotation rate and luminosity implies a strong 
correlation between the dark matter that dominates the 
potential and the baryonic matter that shines. Consider- 
able effort has been made to try to understand this link 
(Bell et al. 2003; Dutton et al. 2007). A focus of recent 
attention has been on the evaluation of how the correla- 
tion changes with look-back time (Conselice et al. 2005; 
Weiner et al. 2006). Our main interest is more mod- 
est: use of the empirical correlation as a way to measure 
distances (TuUy & Pierce 2000; TuUy et al.'2008). 

Neutral Hydrogen is easily detected in nearby spiral 
and irregular galaxies with modern radio telescopes. The 
product of an observation with a single dish facility is a 
line profile which can be grossly characterized by three 
parameters: an integrated fiux, a systemic doppler shift 
from the rest wavelength, and a lincwidth due to inter- 
nal motions. The distances over which galaxies can be 
detected depends on the sensitivity of telescopes and the 
intrinsic gas content of galaxies. 

Regarding telescopes, the bigger the better. Consider 
the situation that is generally close to being met of unre- 
solved sources. Receivers and efliciencies being equal, the 
advantage of a big telescope in exposure time required to 
reach a given signal-to-noise goes as the fourth power of 
the aperture. The signal-to-noise, S/N, achieved in a 
unit time, Iq. depends on the square of the aperture, D. 
of the telescope. To reach a specific signal-to-noise re- 
quires a time t: S/N oc (t/^o)^ oc £>^. Arecibo Telescope 
is presently by far the most sensitive single dish instru- 
ment for HI line studies. Unfortunately it only accesses 
30% of the sky. An additional advantage of a large tele- 
scope is a relatively small primary beam, hence reduced 
source confusion. A small beam only becomes a disad- 
vantage when it is smaller than the dimensions of the 
source, a situation that can result in lost fiux and a bi- 
ased linewidth. 
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Fig. 1. — Top: Fraction of HI mass in logarithmic mass intervals 
derived from the HI mass function of Zwaan et al. (2005). Bottom: 
HI mass per 100 km s^^ of linewidth for a sample drawn from the 
Ursa Major Cluster complete to an HI mass limit of lO'^ Af©. 

Regarding the properties of galaxies, it is instructive to 
consider the HI mass function (Zwaan et al. 2005). The 
cut-off at high mass is abrupt. Systems with higher HI 
mass than 3 x 10"'^'' Mq are rare. It can be supposed that 
larger gas reservoirs than this limit quickly get converted 
into stars. By contrast, many so-called dwarf galaxies 
have abundant HI, refiecting low time-averaged star for- 
mation rates. The result is a pile-up of 90% of HI masses 
in the two decade range 1.5x10^—1.5x10^° Mq. Figure 1 
demonstrates that most of the neutral gas in the z = 
Universe is locked up in galaxies (which are mostly spi- 
rals) with logMni/MQ = 9.2 ± 1.0. In the second panel, 
a histogram is presented of HI mass divided by profile 
linewidth (measured at 20% of peak intensity in imits of 
100 km s^^). The sample is drawn from the Ursa Ma- 
jor Cluster at 17 Mpc and is essentially complete to an 
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HI mass of lO"^ M© (TuUy et al. 1996). Eighty-five per- 
cent of the sources are contained within a single decade 
2x10^-2x10^ Mq per 100 kms'^ Total intrinsic fluxes 
are constrained to a two decade window (left panel of the 
figure) and, since sources with greater fluxes tend to have 
larger linewidths, intrinsic fluxes per spectral channel are 
constrained to a single decade (right panel). The curi- 
ous consequence is that galaxies typed Sb-Sc-Sd can be 
detected in HI with comparable likelihood. A program 
to observe these kinds of galaxies can be expected to 
have a high level of completion within a volume dictated 
by telescope, time available, and motivation. Current 
capabilities can be evaluated by giving consideration to 
Figure 2. Note the sharp upper cut-off in Mhi which is 
rather flat with distance (increased volume). 
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Fig. 2. — HI mass assuming distance = cz km s ^ / 75 km s ^ 
Mpc~^ vs. systemic velocity for 15,000 galaxies in the Lyon Extra- 
galactic Database (LEDA). The solid curve traces the locus of an 
integrated flux of 1 Jy km s^^, a practical limit with the Arecibo 
Telescope. 

It is one thing to detect HI in a galaxy and quite an- 
other to obtain a signal useful for the determination of 
distances. Non-pathological line profiles have character- 
istic features that are helpful. In the case of massive 
galaxies, most of the flux is at the high and low fre- 
quency extremes, originating from gas on the flat part of 
rotation curves. The consequence is profiles with abrupt 
edges and fiux where it is most useful to define these 
edges. For galaxies of sufficiently low mass, those with 
more slowly rising rotation curves, the line profiles lose 
the two-horn shape and instead can be approximately 
gaussian. These systems tend to have less total HI but 
it is piled up over a small range of wavelengths, feeding 
the conspiracy of a common detectability level. 

2. EXISTING HI INFORMATION 

There has been a transformation in the last decade 
in the way data is taken at radio telescopes. In the 
early days, the observer made measurements from ana- 
log displays and the published output were profiles on a 



journal page. The catalog in the Extragalactic Distance 
Database (EDD) called Pre-Digital HI gives a compila- 
tion of information laboriously extracted from literature 
profiles from many dozens of sources (cf. Huchtmeier 
& Richter (1989)). The great concern and attention in 
the compilation of this historical catalog was with pro- 
file linewidths (the fluxes in the catalog are of mixed 
quality and may often suffer from beam dilution effects) . 
Attention should be given to the parameter W20, the 
linewidth at 20% of peak intensity. Often, profiles for a 
given galaxy are available from multiple sources. There 
is a velocity cutoff of 3,000 km for the entries in this 
catalog. A significant contribution came from observa- 
tions by two of the authors (Fisher & TuUy 1981). It is to 
be emphasized that the linewidths given in the Pre Dig- 
ital HI catalog do not come from the literature source; 
they were all derived from published graph profiles by 
Fisher, TuUy, or our colleague Cyrus Hall. 

Some use will be made of this pre-digital information 
in what follows. This carefully accumulated information 
will be used as a basis of comparison with digital data 
processed by machine. A value from the Pre Digital HI 
catalog can even be preferred as a primary source in some 
cases of very large, nearby galaxies that could only be 
satisfactorily observed with the large beams of small tele- 
scopes that are no longer in service. Of course, care must 
be taken to avoid systematics in the translation of profile 
information between different systems. 

The turn of events that marginalizes the Pre Digital 
HI material is the ubiquitous availability of digital spec- 
tral information from radio telescope archives. There is 
no need to go to the telescope to access a rich load of 
data. Of course, the authors of each source will have 
analyzed their material in their own way. At issue for 
us, though, is the need to analyze all profile informa- 
tion from whatever source in a common way. One less 
than satisfactory approach would be to ingest parame- 
ters such as line widths, fiuxes, and systemic velocities 
from literature sources and attempt to reconcile related 
measurements. However more coherent results can be 
expected if a standard analysis procedure is applied to 
the directly observed spectra. 

There are very large and very good compilations of 
HI material already available. The most important are 
assembled in separate catalogs within EDD. Foremost, 
there is the extensive database compiled by the Cor- 
nell group labelled Spring oh /Cornell HI (Springob et al. 
2005) which presents material obtained with the Arecibo 
Telescope, the old 140-foot and 300-foot Green Bank tele- 
scopes, and limited material from the Nangay and Effels- 
berg telescopes. Additionally, EDD contains tables and 
links relating to observations by other observers at in- 
dividual telescopes. The results of a major observing 
program with Nangay Telescope (Theureau et al. 2006) 
are accessed through the catalog HI Nancay. Key re- 
sults from the Parkes Telescope all-southern-sky multi- 
beam project (Koribalski et al. 2004) are reported in the 
catalog HIP ASS 1000. 

Although the material available from archives is ex- 
tensive, it is not sufficient. There are galaxies that 
are important to our program that have not been sat- 
isfactorily observed. Details regarding our samples will 
be described elsewhere but here is a fiavor. One sam- 
ple contains all suitably inclined and unobscured spirals 
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within 3,000 km s ^ brighter than ~ 0.3i* (see the cat- 
alog V3k Mk < -21 in EDD). Another is of spirals 
out to 6,000 km s~^ selected from the Infrared Astro- 
nomical Satellite Point Source-Redshift Catalog (IRAS 
PSC-z; see the catalog Saunders PSCz in EDD). Yet 
another sample consists of galaxies extending out to 
10,000 km s^^ with observed supcrnovac of Type la, in- 
cluded for the purpose of improving the zero-point cali- 
bration of the supernova scale. 

For the present discussion, the point to be made is that 
we needed to supplement the archival material with our 
own observations, which required that wc develop proce- 
dures for the analysis of our observations. Wc might have 
simply adopted a pre-existing procedure, say, incorporat- 
ing one of the linewidth measures described by Springob 
et al. (2005) . However, we decided we would learn more 
if wc developed our own algorithm, albeit one inspired 
by an approach considered by Springob et al. The ex- 
ercise would provide independently derived results that 
could, in many cases, be compared with other sources to 
evaluate uncertainties. 

The discussion continues in the next section with de- 
scriptions of our observations with the Arecibo, Green 
Bank, and Parkes telescopes. In the ensuing four sec- 
tions there are descriptions of how we measure profile 
parameters, evaluations of product quality, and a brief 
discussion of adjustments to give a parameter of dynam- 
ical interest. In the section before the summary, there is 
a description of the catalog in EDD called All Digital HI 
which contains both the results of our own observations 
and the reprocessed results of data from the archives, all 
treated with the same procedures. 

3. OBSERVATIONS WITH THE ARECIBO, GREEN BANK, 
AND PARKES TELESCOPES 

Our program began once the Arecibo Telescope was 
brought back in service after installation of the Gre- 
gorian feed and ground screen. Since the start of the 
multibeam sky survey Arecibo Legacy Fast ALFA (AL- 
FALFA), we have discontinued our own observations with 
Arecibo Telescope, with the expectation that many of 
our sources in the survey range 0° < 5 < 36° will be 
observed serendipitously with sufficient accuracy. Sub- 
sequently, we have been observing with the 100m Green 
Bank Telescope at declinations above S = —45° but ex- 
cluding the Arecibo range. As of the third semester of 
2008 this program has been awarded the status of a Large 
Program, now christened with the name Cosmic Flows^. 
Results from this program are appearing in the All Dig- 
ital HI catalog as they become available. Access to the 
remaining sky, at i5 < —45°, requires use of Parkes Tele- 
scope in Australia. Observations with this facility began 
early in 2009. 

3.1. Arecibo 

The single-beam Arecibo observations of 330 galaxies 
were undertaken in two sessions, October 1999 and April 
2001, mostly between sunset and sunrise to avoid spec- 
tral baseline distortion from solar continuum emission. 
The 1999 session used the "L-Narrow" receiver for ob- 
jects with known redshifts and the "L-Wide" receiver for 
unknown redshifts that required a search over a wide ve- 

^ http://www.vla.nrao.edu/astro/prop/laj:geprop/ 



locity range. The 2001 session used only the "L-Wide" 
receiver with improved system noise temperature. 

Eight correlator sections, each with 2048 spectral chan- 
nels, were available. The unknown redshift search used 

4 overlapping, 25-MHz bandwidth spectra for a velocity 
span from -400 to -1-18,000 km s~^, heliocentric, in each 
of two linear polarizations. The known redshift spec- 
tra were taken with 4 correlator sections, 2 polarizations 
with 12.5 MHz bandwidth and 2 with 6.25 MHz band- 
width, all centered on the galaxy line profile. All spectral 
data were Hanning smoothed to produce resolutions of 
1.29, 2.58, and 5.16 km s^^ at zero redshift with 6.25, 
12.5, and 25 MHz bandwidths, respectively. Narrowband 
RFI was edited manually and data values replaced with 
a linear interpolation. For most spectra a second-order 
baseline was least-squares fit to the data on either side 
of the line profile and subtracted from all spectral data 
values. In no case was the baseline curve higher that 
third order. 

The basic Arecibo observation was 7 minutes on the ob- 
ject and 7 minutes on a blank-sky position on the same 
hour angle track. When time was available weak line pro- 
files were observed on more than one day and the spec- 
tra were averaged. The line profile fiux density scale was 
established with correlator observations of known con- 
tinuum sources over the full scan range of the Arecibo 
telescope to determine gain as a function of zenith dis- 
tance. 

3.2. Green Bank 

The single-beam Robert C. Byrd Green Bank Tele- 
scope (GBT) observations were carried out in the course 
of two programs, one program from 2001 to 2002 and 
one large project beginning February 2006 and continu- 
ing through May, 2009. The observations were made day 
and night. Roughly 1000 galaxies have been observed 
between these programs. 

The earlier observations were conducted during com- 
missioning of the GBT in the fall and winter of 2001/2 
as a background 21-cm observing program during times 
when the telescope was not occupied with tests or cali- 
bration. Simple on-off spectral line measurements were 
made to acquire global HI profiles of galaxies at redshifts 
out to about 10,000 km s~^. Integration times were be- 
tween 10 and 60 minutes and typical bandwidths were 

5 or 10 MHz depending on the expected signal strength 
and line profile width. All observations used the FFT 
spectrometer which has 1024 channels for each of the 
two linear receiver polarizations. The system tempera- 
ture was slightly under 20 K at high elevations and the 
100-meter aperture efficiency was roughly 70%. Intensity 
calibration of the HI survey of galaxies during commis- 
sioning of the Green Bank Telescope (GBT) is tied to 
the NVSS fiux density scale (Condon et al. 1998). About 
five dozen continuum sources with fiux densities between 
2.2 and 6.0 Jy were selected to avoid significant multi- 
source confusion with the nine-arcminute GBT beam. 
The continuum calibrators were observed with the same 
spectrometer and rec;eiver configuration as was used to 
measure the HI line profiles in the survey, with the ex- 
ception that the spectrometer bandwidth was always 40 
MHz centered on 1403 MHz to span most of the range of 
frequencies observed with smaller bandwidths. 

The calibrator observing sequence was 2 minutes off. 
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2 minutes on, and 2 minutes off source in spectral line 
mode. The first off position was 38 arcminutes toward 
lower right ascension than the source position, and the 
second off was the same distance toward greater right 
ascension. The hour angle track was, therefore, not ex- 
actly the same for the three observed positions for high 
declination objects, but this did not appear to degrade 
the spectral baselines significantly. The first task in the 
calibrator data reduction was to visually inspect the dif- 
ference spectrum between the two off positions. The 
difference spectrum offset was typically less than about 
60 mjy, as is expected from confusion noise with the 
GBT beam size, but a few offsets were as high as 300 
mJy. These large offsets were possibly due to a moder- 
ately strong source in one of the off positions or, more 
Ukely, a bit of radiation from the Sun during the day. 
Since the observed source flux density was about 3 Jy, 
even the largest off-source baseline offset, after the two 
off spectra were averaged together, caused about 5% er- 
ror in the measure source intensity. More typically, this 
source of error amounted to less than 1%. The statistics 
of the calibrator source measurements were not signifi- 
cantly improved by throwing out observations with larger 
off-position differences so all data were retained. 

For the observations since 2006 use is made of the single 
beam L band (1 to 2 GHz) receiver and the spectral line 
spectrometer as the backcnd detector. Data are taken 
with a 12.5 MHz bandwidth and 9-level sampling. To- 
tal power observations arc made with a full calibration 
noise source switching cycle of 1 second. The spectrom- 
eter records data every 30 seconds. The spectral line is 
Doppler tracked in the barycentric velocity frame. Data 
are taken using linear polarizations. Redshifts from the 
Lyon Extragalactic Database (LEDA) or NASA Extra- 
galactic Database (NED) were used to center the win- 
dow. 

The basic GBT observation procedure was to take a 
pair of on-off observations with 300s on and 300s off the 
target. Galaxies within 3,000 km s^^ usually required 
1 to 3 scan pairs, while galaxies from the PSCz sam- 
ple reaching up to 8,000 km s~^ required 10 to 15 scan 
pairs. A preliminary guess on how much observing time 
a target would need was derived from the 21 cm magni- 
tude given in LEDA. In order to optimize the observing 
time, a target was observed with time split over several 
days. Data were reduced daily and evaluated in order 
to add observing time as needed until the signal reached 
the desired very high quality for a luminosity linewidth 
distance measurement. 

Individual observations are calibrated in Jy using the 
standard calibration procedure available at the GBT. 
GBTIDL provides basic routines that can be used to cal- 
ibrate and average spectra when the data are taken in 
standard, predefined observing modes. The calibration 
routines typically give a flux scale accurate to 10%. Well- 
known galaxies that can be used as HI calibrators were 
also observed several times per month in order to be able 
to retrieve a more accurate flux calibration if needed in 
the future. 

The calibrated data are then averaged, baseline sub- 
tracted using a polynomial flt usually of order 3, and 
smoothed with a simple Hanning filter. The final spec- 
trum is stored with 1.6 km s~^ resolution. It was usually 
binned at least once to 3.2 km s~^ resolution for the HI 



linewidth measurement. 

3.3. Parkes 

Observations with Parkes Telescope make use of the 7- 

beam system in MX mode, with the target in the central 
beam and the 6 outer beams used to monitor the sky. 
Integration times are estimated based on fiuxes obtained 
with HIPASS integrations of roughly 8 min per pointing, 
velocity resolution of 18 km s~^, and r.m.s. sensitivity of 
13 mJy per channel. For linewidths adequate for our pur- 
poses we attempt to obtain spectra with peak signal to 
noise of at least 10 with spectral resolution of 2 km s~^. 
Galaxies at 3000-4000 km s^^, which are the most com- 
mon of our targets, typically require 60 min on source. 
Profiles are evaluated at 30 min exposure and, if inade- 
quate, the source is reobserved and profiles are summed. 
Daytime observations are avoided to minimize degrada- 
tion by the Sun. The Parkes multibeam data are reduced 
using the graphical user interfaces Livedata, Gridzilla, 
and MIRIAD. The data is calibrated in Janskys using 
the procedure described by Barnes et al. (2001). 

At press, a first observing run during January- 
February 2009 has been completed culminating in ob- 
servations of 58 galaxies. Results from this run and the 
much larger archival sample of 1000 brightest sources 
from the HIPASS program (Koribalski et al. 2004) are 
analyzed and included in EDD. 

4. PROFILE LINEWIDTHS 

The linewidth measure given in the Pre- Digital HI cat- 
alog of EDD and used by us since the early paper by 
Tully & Fisher (1977) is W20, the linewidth at 20% of 
peak intensity. This is an appropriate moment to eval- 
uate whether that parameter choice is optimal since we 
now do the analysis on digital data with a rigorous al- 
gorithm and apply the same procedure to all available 
material. The thinking behind the original c;lioice was 
that a linewidth at a very low level of intensity with 
respect to the maximum is desirable to minimize depen- 
dencies on vagaries in the distribution of flux within the 
profile. The opposing constraint is the need to be above 
the noise level. With profiles deemed adequate, it was 
empirically determined that the 20% of peak intensity 
level is sufficiently out of the noise. Adequate profiles 
are characterized by peak signals at least seven times 
greater than the noise. 

As we consider alternatives, we look to the study by 
Springob et al. (2005) [SHGK]. Their data are made 
available at the Cornell University Digital HI Archive 
website^. Those authors have given attention to a 
large body of high quality data from their own observa- 
tions and from the archives. They derived five separate 
linewidth parameters with automated algorithms. These 
distinct linewidth measures can be compared with each 
other and, for most of the galaxies within 3,000 km s~^, 
with the W20 values from the Pre-Digital HI catalog in 
EDD. Comparisons are shown in Figure 3 for 1110 galax- 
ies considered to have good W20 measures. In 3% of cases 
the linewidths are discordant by more than 50 km s~^. 
These large differences are due to cataloging errors or 
gross errors due to noise. Automated procedures are vul- 
nerable to occasional gross errors - as manifested by big 

^ http://arecibo.tc.cornell.edu/hiaJchive 
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differences between the five SHGK parameters in a small 
fraction of cases. In tfie following discussion, we clip in- 
stances with deviations greater than 50 km s~^ from the 
mean of W20 — Wx , where X is one of the five linewidths 
given by SHGK. 
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Fig. 3. — Comparison between the W20 linewidth parameter 
from the 'Pre-Digital HP catalog in EDD with two of the linewidth 
parameters given by SHGK, Wpso a-^d Wmso- The comparison is 
based on 1110 cases with good W20 measures. 

It turns out that we find a clear preference for one of 
the SHGK linewidth parameters. It is not the parame- 
ter advocated as optimal by SHGK. Those authors prefer 
the parameter Wf5o, the width at 50% of the peak mi- 
nus rms flux with left and right edges evaluated indepen- 
dently with polynomial fits to the rising portions of the 
profile. In Table 1 we collect comparisons between our 
old W20 values and the five SHGK values (plus their pa- 
rameter Wc which is Wf5o with redshift, instrumental, 
and smoothing corrections). Of course, zero-point offsets 
are expected. The figure-of-merit is the r.m.s. disper- 
sion. The SHGK parameter Wm50 gives a significantly 
better correlation with our W2q. Conveniently, it also 
gives rather close agreement in zero-point. Figure 4 gives 
a detailed comparison between W20 and the two SHGK 



TABLE 1 
Differences between Pre Digital 
AND SHGK Linewidth Measures 
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linewidth parameters of greatest interest. In the case of 
the parameter Wpso the mean difference with respect to 
W20 is displaced from the peak of the distribution (the 
distribution is skewed) and (after clipping values more 
deviant than ±50 km s~^ from the mean) the dispersion 
is a rather substantial 17 km s"'^. In the case of the 
parameter Wm50 the histogram is symmetric and (after 
clipping values more deviant than ±50 km from the 
new mean) the dispersion is a reasonable 10 km s~^. 



n 



AW (x=M) r 

±10 



AW (X=F) 
= 25 ±17 




Fig. 4. — Histograms of the differences in linewidth value W20 
given in the 'Pre-Digital HP catalog of EDD and the linewidth 
values Wxso given by SHGK where X = F (dotted histogram in 
red) is the parameter preferred by those authors, a measure at 50% 
of peak flux, and X = M (solid histogram in black) is a measure 
at 50% of the mean flux. The histogram of W20 — Wpt^Q values 
is skewed with a tail to positive differences and a relatively large 
dispersion. The histogram of 1^20 ~ values is symmetric and 

has lower dispersion. 

The parameter VFmso is the linewidth at 50% of the 
mean flux level within the HI signal. This construct has 
nice features. Using the mean flux level rather than the 
peaks serves to disengage the linewidth measure from 
details of the gas distribution. It gives a more natural 
transform over to single peak cases. The measurement is 
at a low level compared with the peak, statistically only 
slightly above the level that gives W20 ( hence the similar 
zero-point). The main operational challenge is to define 
the window containing the signal. SHGK reserve most 
of their discussion for their linewidth parameter Wpbo 
and do not give details of the derivation of Wm5q- We 
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have developed an algorithm which might differ in minor 
details. An empirical test of that possibility will come 
from inter comparisons of our separate results with the 
same data, reported in the next section. To distinguish 
our parameter from that developed by SHGK, we refer 
to our measurement as Wm5o, that is, with a lower case 
'm'. 

The most sensitive detail with the derivation of Wm5o 
is the specification of the wavelength window for the sum- 
mation of flux. The total flux detected within the single 
beam pointing is relatively well defined. However the 
mean flux per channel can be significantly less well de- 
fined if channels in the wings of the profile are included 
or not because of noise. 

Our specific recipe is to first determine the integrated 
flux within a window that is tight, yet sufficiently wide 
that the profile has reached the baseline level (call this 
the fOO% window). Then the wavelengths are deter- 
mined that exclude 5% of the integrated fiux on each 
of the two wings (these enclose the 90% window). The 
mean flux per channel is then taken to be the sum of 
the flux within this 90% window divided by the number 
of spectral channels. Both the numerator and denomi- 
nator in this calculation require interpolations since the 
wavelengths defining the window are not restricted by 
the discreteness of the spectral channels. 

The linewidth is defined at the level of 50% of the 
mean fiux per channel determined in the manner that 
has just been described. The intersection points at the 
two edges are defined by interpolations between observed 
flux-velocity points on the rising parts of the proflle. 
SHGK use a more elaborate fitting scheme to define edges 
but it is not clear that the effort results in greater pre- 
cision. In clean cases the spectra rise abruptly and in 
ambiguous cases the uncertainty is not necessarily ame- 
liorated by a particular fitting recipe. 

The assignment of errors is a particularly challenging 
problem. Profiles can be messy in so many ways that 
we despaired of finding an algorithm that gives sensible 
results in all cases. Our overwhelming interest is the 
use of profile widths as a parameter in the measurement 
of distances. We consider that there is a threshold of 
acceptability; a profile may be of sufficient quality to 
be used in the determination of a distance, or it may 
not be. We link our error estimate to this threshold. 
Specifically, an adequate profile is assigned an error of 
less than or equal to 20 km s"^. Inadeqiiate profiles 
are identified by errors greater than 20 km s^^. It has 
been found (TuUy & Picrcjc 2000) that with linewidth 
errors constrained to this limit in quality the measure 
of linewidths is not a dominant source of errors in the 
determination of distances. 

The detailed error assignment is arrived at in two steps. 
The first step is automatically generated based on the 
signal (the mean fiux per channel within the 90% win- 
dow), S, to r.m.s. noise, N. Errors were evaluated from 
a training set. Errors of 8 km s~^ are assigned in the best 
cases, whenever the mean flux per channel is greater than 
17 times noise. Errors degrade to 20 km s^^ by a mean 
flux per channel to noise of 2, and continue to increase 
as signal degrades further. Specifically, the error ew is 
assigned based on signal-to-noise, S/N, following: 

ew = Skm s"^ if S/N > 17 

ew = 21.6-0.8 S/N km s-'^ i{2 < S/N < 17 



TABLE 2 

Telescopes Contributing to the Database 



Telescope 


Aperture 


Beam 


Acronym 


Arecibo 


305m 


3' 


AOG-AOlf-ALFA 


Nangay 


200x40m 


4' X 22' 


nan 


GBT 


100m 


9' 


GBT 


EfEclsberg 


100m 


9' 


Effs-Eff 


GB300 


91m 


10' 


GB300 


Parkes 


64m 


14' 


PARS 


GB140 


43m 


21' 


GB140 



TABLE 3 
All Digital HI Catalog Sources 



Code Literature Source 

ksk2004 Koribalski, Staveley-Smith, Kilborn, et al. 2004 

shg2005 Springob, Haynes, Giovanelli, Kent 2005 

hkk2005 Huchtmeier, Kajrachentsev, Karachentseva, et al. 2005 

tmc2006 Theureau, Martin, Cognard, et al. 2006 

ghk2007 Giovanelli, Haynes, Kent, et al. 2007 

sgh2008 Saintonge, Giovanelli, Haynes, ct al. 2008 

kgh2008 Kent, Giovanelli, Haynes, et al. 2008 

ctf2009 Courtois, TuUy, Fisher, et al. 2009 (this paper) 



e^y = 70 - 25 S/N km s'^ if S/N < 2 

Errors are not allowed to be less than the spectral reso- 
lution after smoothing. Our error estimates are conser- 
vative; roughly a factor 2 larger than values found in the 
literature associated with the same observations. Cross- 
correlations between data sets to be discussed later will 
demonstrate that our error estimates must on average be 
^ 50% greater than a Icr value. 

The second step involves a manual inspection, given 
to every profile. The fit and error estimate proposed by 
the computer algorithm is displayed on a monitor. The 
operator can accept or modify. It may be necessary to 
excise interference, or reposition the edges of the 100% 
window (the most common occurrence), or smooth. The 
computer redisplays. The final decision by the opera- 
tor is whether the error is appropriate, which is a binary 
decision to the question: is the profile adequate or inade- 
quate for the purpose of measuring a distance? The error 
only needs to be changed one way or the other across the 
20 km s~^ acceptance threshold if the carbon-based de- 
cision contradicts the silicon-based assignment. 

4.1. Good, Bad, and Ugly 

Examples of good profiles are seen in Figure 5. They 
are characterized by high flux 'horns' at the extrema of 
the profiles, which arise from emission from the fiat por- 
tion of rotation curves near maximum velocity. The pro- 
files rise rapidly, leaving little uncertainty in the mea- 
surement of the widths. There is apparent splitting into 
two peaks even with the narrowest profile chosen as an 
example in this figure although typically the distinctness 
of two peaks is lost in narrow profiles. In this figure and 
others to follow, information regarding the source of the 
profile is encoded in the header above the profile. One 
is given the PGC number, a common name, the archival 
or new observation source in a code given in Table 3 and 
the telescope as identified in Table 2. 

The catalog All Digital HI now contains information 
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sured profiles for apparently single targets, that for PGC 
71392 (UGC 12591) is by far the widest, with W^so = 
989 km s~^. The line profile and an image of the galaxy 
(typed SO/Sa) are shown in Figure 6. Giovanelli et al. 
(1986) have drawn attention to this unusual galaxy. That 
reference notes that there is probably absorption from a 
central continuum source affecting the spectrum. 
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Fig. 5.— PGC 3671 (NGC 337A), PGC 6061 (UGC 1167), 
PGC 29096 (ESQ 316-018), and PGC 26512 (NGC 2841), profiles 
with increasing linewidths, observed with the NRAO 300', Arecibo, 
Byrd Green Bank, and NRAO 140' telescopes respectively. 

on approximately 13,000 galaxies. Of reliably mea- 




FlG. 6.— PGC 71392 (UGC 12591), the galaxy with the largest 
linewidth with WmSO = 989 km s"! at Vn = 6967 km s'^. The 
velocity scale for this exceptionally wide profile is expanded. Ob- 
served with Arecibo Telescope. 



At the other extreme, PGC 10314 (NGC 1058) is the 
most anorexic of galaxies in the current database. The 
profile is shown in Figure 7. This galaxy has been identi- 
fied by Lewis (1975) and van der Kruit & Shostak (1984) 
as an example of a galaxy seen almost face on. 

There are lots of bad spectra among the 15,000 pro- 
files in All Digital HI. The two main reasons for bad 
spectra are poor signal-to-noise and confusion from mul- 
tiple sources in the radio beam. It is easy to find exam- 
ples of poor spectra; select cases with error assignments 
ew > 20 km s^^. We adopt the convention of assigning 
ew — 100 km s^^ in cases of confusion, and ew = 500 
km s~^ in cases of null detections, although we have not 
been consistent. Our fundamental convention is, if the 
profile is inadequate to the task of measuring a distance, 
then an error greater than 20 km s~^ is assigned. The ex- 
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Fig. 7.— PGC 10314 (NGC 1058), the galaxy with the narrowest 



Uncwidth with 



: 33 km s-l. Observed with GBT. 



act value of an error assignment greater than 20 km s~^ 
has httle rigor. 

While it is not worthwhile to dwell on the bad, it is 
instructive to consider a few examples of the ugly. A 
cautionary example is illustrated in Figure 8. The two 
profiles were obtained with Arecibo Telescope and the 
Green Bank 140-foot Telescope with, respectively, half- 
power beams of 3' and 21'. The galaxy NGC 7814 has 
a diameter at the B band isophot of 25 mag as~^ of 
5.5', larger than the Arecibo beam but much smaller 
than the 140-foot beam. Flux is lost in a single beam 
pointing with Arecibo Telescope but not with a pointing 
involving the smaller telescope. The lost flux from the 
extremities of the galaxy with the Arecibo observation 
cause a pronounced reduction in the 'horns' and affects 
the measurement of the profile width. 

In Figure 9 one sees what seems to be a normal edge- 
on spiral but one horn is very pronounced and the other 
is almost unseen. A profile like this creates a problem if 
the measurement of the width is referenced to the peak 
fiux. It creates somewhat less problem with our deriva- 
tion based on the mean fiux. 

The situation is even more extreme with the galaxy 
shown in Figure 10. In this case the distribution of neu- 
tral hydrogen in the Arecibo profile is so asymmetric that 
the long wavelength edge of the profile is poorly defined. 



Fig. 8.— PGC 218 (NGC 7814). Top profile: The 3' half-power 
beam of Arecibo Telescope is smaller than the target galaxy causing 
flux to bo lost and affecting the profile shape. Bottom profile: All 
the flux emitted by the galaxy is captured within the beam of the 
Green Bank 140-foot Telescope. 



Given the size of the galaxy, the Arecibo observation 
may be missing fiux. The profile obtained with Parkes 
Telescope is asymmetric in the same sense though less 
extreme. The 14' primary beam of the Parkes Telescope 
is considerably bigger than the source so the asymmetry 
is undoubtedly a real feature of this galaxy. In this case, 
the Parkes profile is preferred over the Arecibo profile. 

In the case shown in Figure 11 the profile has gone 
beyond ugly to bad. There is a substantial wing on the 
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Fig. 9.— PGC 6759 (NGC 684). One HI peak is much more 
pronounced than the other. This profile was obtained with Arecibo 
Telescope. 

long wavelength side. The linewidth is acutely sensitive 
to the choice of level of measurement. The galaxy looks 
distorted. A galaxy interaction is suspected. 

With IC 2511 seen in Figure 12 the profile is merely 
ugly. There is a wing on the short wavelength side. The 
linewidth is sensitive to the choice of measurement flux 
level, but not sufliciently to cause us to reject the profile 
by assigning an error greater than 20 km s""'^. There is 
no evidence of an abnormality in the image of the galaxy. 
It can be appreciated that there is a continuum of situa- 
tions between those shown in Figs. 11 and 12, commonly 
aggravated by much worse confusion from noise. In the 
final analysis, profiles have been accepted or rejected (as- 
signed errors less or greater than 20 km s~^) on the basis 
of visual inspection. 

An example of contamination from multiple sources 
in the radio beam is provided by Figure 13. Both the 
galaxy at the center of the image and the fainter object 
4.7' northeast have been observed with separate point- 
ings with the Arecibo Telescope. The half-power beam 
diameter with that telescope is 3 arcmin. One of the 
sources is cleanly detected with a peak flux four times 
greater than the other. Lo and behold, it is the smaller 
fainter galaxy to the northeast. The visibly dominant 
galaxy is detected in HI but the profile is messy and 
probably contaminated by fiux from the companion. We 
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Fig. 10.— PGC 4063 (UGC 711). Profile on top obtained with 
Arecibo Telescope is extremely asymmetric and is suspected to be 
biased because of beam attenuation. Profile on the bottom was 
obtained with the 5 times larger beam of the Parkes Telescope. 
The asymmetry of the profile is not just a resolution effect. 

assign an error of 100 km s^-'^ (confused) to the linewidth 
of the brighter galaxy. 
Another all-too-common situation is illustrated in Fig- 
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Fig. 11.— PGC 2081 (NGC 157). Wing on right edge of profile. 
The galaxy is somewhat distorted. The observation was made with 
the Green Bank 140' telescope. 

lire 14. The profile is anomalous, with a pronounced peak 
and shoulders. In the image, a second galaxy of unknown 
velocity is seen that would lie near the half-power level 
of the Arecibo Telescope beam. A third galaxy, fainter 
and more distant, could conceivably contribute to the 
confusion. 

These examples give a reminder of the advantages of 
observations with different facilities. The relevant tele- 
scopes and the single beam half power field diameters are 
listed in Table 2. Nangay Telescope has an unusual beam 
shape that provides good resolution east-west but poor 
resolution north-south. Arecibo Telescope provides the 
best resolution but attention must be given to possible 
loss of flux with large targets. 

The examples given attention in this section provide 
the warning that there are lots of unacceptable line pro- 
files for our purposes. Even among those identified as 
acceptable by the error estimate, comparisons when al- 
ternative observations are available reveal that 2-3% are 
bad. Still, among 15,000 profiles there are an abundance 
of good data. In the next section there is an evaluation 
of how good is good. 

5. EVALUATION OF THE NEW Wmso PARAMETER 

Comparisons between alternative linewidth parameters 
are illustrated in Figures 15 and 16. In each case, mea- 



FlG. 12.— PGC 28246 (IC 2511). Messy wing on left side of 
profile obtained with GBT. 

sures with uncertainties greater than 20 km s^^ are re- 
jected so the comparisons are between data that are sup- 
posed to be good. In Fig. 15, the comparison is between 
our Wm50 parameter reported in the All Digital HI cat- 
alog and the W20 parameter in Pre Digital HI. In the 
top left panel, the Wmso values are derived exclusively 
from data extracted from the Cornell HI archive. Of 11 
cases with linewidth measures that deviate by more than 
50 km s~^ from the mean in a sample of 1107 galaxies, 
five can be traced to confused profiles caused by near 
neighbors. The mean difference of < W20 — W^mso >= 
15 km s-i with the 1096 remaining galaxies is expected 
since W20 is measured at a fainter level. The r.m.s. scat- 
ter after elimination of the 11 most deviant cases is a 
reasonable 11 km s~^. In the top right panel, there is a 
slight augmentation of the sample through the inclusion 
of new data acquired by the authors. By plotting the 
difference in linewidth measures on the ordinate there is 
sufficient scale resolution to detect a weak dependency 
in the difference in linewidths with the amplitude of ro- 
tation: W20 - Wm5a = 17.7 - 0.012W™5o. In the lower 
panel, it is seen that the histogram of the differences 
W20 — Wm50 is symmetric about the mean. The details 
of this comparison are reported at the bottom of Table 1. 

The comparison in Fig. 16 involves two separate algo- 
rithms to determine the linewidth at 50% of mean flux. 
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Fig. 13.— PGC 68870 (NGC 7280) and PGC 68878 (UGCA 
429) separated by 4.7'. Arecibo observations of each resolves the 
separate sources. UGCA 429 at upper left has the stronger, un- 
ambiguous signal. NGC 7280 in the center of the image is clearly 
detected but flux in the right wing must be associated with UGCA 
429. 

There is the parameter given by SHGK referred to as 
Wm5o and the variant determined by our procedure re- 
ported in All Digital HI called Wm5Q- As in the previous 
figure, the data displayed in the top left panel draws 
exclusively from the Cornell HI archive while the data 
used to generate the right panel is slightly augmented 
by new observations. The results are substantially the 



same. There is an offset of 6.6 km between the 
SHGK parameter Wm50 and our Wmso- The SHGK pa- 
rameter is evaluated at a slightly lower flux, presumably 
because they evaluate the mean flux over a wider wave- 
length window than our 90% window. There is a small 
but significant dependence of the offset on rotation rate: 
Wm5o - Wmbo = 10.8 - 0.015Wm5o. Again, 1% of cases 
are deviant by greater than 50 km s~^, usually because of 
confusion caused by a companion. Those aside, the r.m.s. 
difference from the mean is a satisfactory 9 km s~^ . How- 
ever as seen in the lower panel, the distribution is slightly 
skewed, with a tail to positive differences. 

Besides the Cornell HI archive, another important 
source of HI profile information is the LED A database 
(Paturel et al. 2003) and the related HI archive'^ associ- 
ated with the 'Kinematics of the Local Universe' (KLUN) 
project (Theureau et al. 2006). Derivative parameters 
and a discussion of results are presented by Theureau 
et al. (2007). Access to the KLUN tabular material and 
profiles is provided in EDD through the HI Nancay cat- 
alog. 

A comparison with the KLUN results is not straight 
forward because of mixed use of the optical and radio 

^ http://klun.obs-nancay.fr 
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Fig. 15. — Top: Comparison between the linewidth at 50% of 
mean flux in the catalog All Digital HI and the linewidth at 20% 
of peak intensity in the catalog Pre Digital HI. Middle: Linewidth 
difference as a function of Wm50 with a least squares fit superposed. 
Bottom: Histogram of differences W20 — WmSO- 
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Fig. 16. — Top: Comparison between two alternative estimators 
of the linewidth at 50% of mean flux, WmSO from the All Digital HI 
catalog and Wmtm from the Spring ob/ Cornell If/ catalog. Middle: 
Linewidth differential plot with least squares fit as a function of 
Wm50- Bottom: Histogram of differences Wm5Q — WmhO- The 
distribution is skewed, with a median difference of 5 km s~^. 
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conventions for transforming doppler shifts to velocities. 
In the optical convention, which we use, one considers the 
shift in wavelength with respect to the rest value, Vopt = 
c(A — Ao)/Ao, while in the radio convention one consid- 
ers the shift in frequency, Vrad = c^vq — v)/vq. Profiles 
displayed in the Nangay database are presented in the ra- 
dio convention although it is to be noted that the same 
profiles made available through NED, the NASA/IPAC 
Extragalactic Database^, have been converted to the 
optical convention. The tabular information presented 
by Theureau et al. (2007) is mixed. Systemic veloci- 
ties have been transformed to the optical convention but 
linewidths appear to have remained in the radio con- 
vention. For a galaxy at 7000 km s"'^ with a linewidth 
~ 400 km s~^ the linewidth in the optical convention 
is ~ 20 km s^^ wider than in the radio so the issue is 
significant. 

We have made comparisons between KLUN linewidths 
at 20% of peak intensity, adjusted to the optical con- 
vention, and Wm5o linewidth values drawn from the All 
Digital HI catalog. The comparison accepts only cases 
from the All Digital HI catalog with linewidth errors 
< 20 km s~^ and cases from the HI Nancay catalog with 
a linewidth quality index assigned by us of 1-3. See Fig- 
ure 17. After rejection of 4 extreme outliers, the scatter 
is 13 km s""'^. There is a hint of a correlation with Wmso 
as found in Figs. 15 and 16. The comparison sample is 
restricted because for the moment only a small fraction 
of galaxies in the HI Nancay catalog have been assigned 
a quality index by us. If galaxies without a quality index 
are accepted then the comparison can be based on an or- 
der of magnitude larger sample of 992 galaxies. However 
the scatter is then much worse. Even after clipping 48 
cases with deviations greater than 80 km s~^ from the 
mean the scatter is a poor 22 km s~^. There will be a 
discussion in the next section of the results of our own 
analysis of profiles extracted from the Nangay database. 
It will be seen that when the data are treated in a uniform 
way there is good agreement between all sources, whether 
drawn from the Nangay or Cornell databases or derived 
from our new observations with GBT and Arecibo. 

The weak but significant tilt in the difference plots as a 
function of rotation rate seen in Figs. 15 - 17 is a conse- 
quence of a different linewidth definition. Our linewidths 
are measured at relatively higher flux levels for small 
galaxies and approach the levels of the other measures for 
large galaxies. A result could be a slightly flatter slope for 
the luminosity-line width correlation. These small differ- 
ences serve to emphasize that the linewidth measures are 
empirical constructs. We are reminded that the details 
of the construct may not be important but consistency 
is required if biases are to be avoided. 

6. COMPARISON OF NEW AND OLD DATA 

The comparisons in the previous section are between 
different measures of linewidth with largely the same 
data. In this section there is a comparison of new and 
old observations of identical targets, all analyzed with 
the procedure discussed in this paper. A point of detail: 
the linewidths discussed in this section have received the 
small correction for spectral resolution discussed in the 
next section to facilitate the comparison of observations 

* nedwww.ipac.caltech.edu/forms/SearchSpectra.html 
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Fig. 17. — Difference between the Nangay/KLUN linewidth at 
20% of peak intensity and the Wm50 linewidth parameter in the All 
Digital HI catalog plotted against the WmSO parameter. A least 
squares fit is superimposed. The slope has only a 2.2cr significance. 

made with different telescopes and receivers. Figure 18 
shows the current status of comparisons between new and 
Cornell archival profiles, giving separate consideration to 
new GBT and Arecibo observations. The archival pro- 
files come from a variety of telescopes, never GBT. In all 
cases, the linewidth measure is our Wm5o parameter. 

The r.m.s. scatter between linewidths drawn from ma- 
terial out of the Cornell HI archive and linewidths deter- 
mined from new observations is a satisfactory 10 km s~^. 
If errors are partitioned equally, the implied uncertainty 
is 7 km s~^ in each of the new and old measures. The 
scatter is similar in the separate GBT and Arecibo com- 
parisons. 

There is a minor mystery in the zero-point offsets seen 
in both samples displayed in Fig. 18. The linewidths 
measured from the archival profiles are 3 km s~^ wider 
in the mean, a difference with 3cr significance. It may be a 
factor that the new profiles have higher signal-to-noise in 
the great majority of cases. This is not a negative reflec- 
tion on the archival material; simply a consequence of our 
strategy of primarily re-observing objects with poor pro- 
files (though recalling that profiles ascribed errors larger 
than 20 km are rejected in all comparison samples). 
With the 258 galaxies represented in Fig. 18, the mean 
difference in the error assigned to the linewidth, archive 
minus new, is 6 km s~^. An explanation for the zero- 
point offset might be that linewidths are slightly overes- 
timated with noisier profiles. 

Another large component of the All Digital 7// catalog 
is built from data extracted from the Nangay database. 
To be clear, the profile fits and derivative parameters for 
observations from the Nangay telescope given through 
the All Digital 777 catalog are based on the analysis pro- 
cedures described in this paper; results from the origi- 
nal source of the data are found in the catalog HI Nan- 
cay. This is analogous to the distinction in the case of 
the material from Cornell, with original source material 
in catalog Springob/ Cornell HI and re-analyzed material 
in catalog All Digital HI. There are 720 galaxies in All 
Digital HI with both a satisfactory {ew < 20 km s~^) 
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compare 126 galaxies: GBT HI t. Cornell HI 
<W.„ - ir.-„> = a.l +-0.9 r.m.s. +-3.6 
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Fic;. 18.— Comparison between our new data and data from the 
Cornell archive, both analyzed with the same pipeline designed 
to measure the linewidth at 50% of mean flux. Top; New data 
acquired with the Byrd Green Bank Telescope. Bottom: New Data 
acquired with the Arecibo Telescope. 

Nangay linewidth and a satisfactory linewidth either 
from the Cornell archive or new as reported here. The 
difference between new/Cornell and Nangay widths is 
Wnancay >= 2-6 ± 0.4 with r.m.s. scat- 



< w 



new / Cornell 

ter 10.8 km s~^ after rejection of 7 cases with excursions 
in excess of 4(t. The comparison is shown graphically in 
Figure 19. The r.m.s. scatter is at a level that, in com- 
parison with the discussion surrounding Fig. 18, implies 
a characteristic uncertainty with the Nancay widths of 
8 km s^i. 

The offset of 2.6 km s~^ seen in Fig. 19 is small but 
statistically significant. The comparison shown here is 
with widths corrected for line broadening. If the compar- 
ison is made on the directly observed Wmso lincwidths 
then the difference is —0.1 km s^^. The oiTset arises 
through the broadening corrections discussed in the next 
section. The Nangay data tends to need larger correc- 
tions and our recipe may be slightly excessive for this 
sample. However, the problem is sufficiently small that 
it will not affect the measurement of distances. 



Fig. 19. — The average of our new linewidths and linewidths 
derived from the Cornell archive data compared with linewidths 
derived from data from the Nangay archive. All data axe analyzed 
with the same pipeline designed to measure the linewidth at 50% 
of mean flux. 
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Fig. 20. — The average of our new linewidths and linewidths 
derived from the Cornell and Nangay archive data compared with 
linewidths derived from data from the Parkes archive. All data are 
analyzed with the same pipeline designed to measure the linewidth 
at 50% of mean flux. 



At the most southerly latitudes, neutral hydrogen ob- 
servations require use of the Parkes Telescope and results 
are becoming available on their archive"''. An important 
contribution has come from HIPASS, the HI Parkes All- 
Sky Survey (Koribalski et al. 2004). A comparison with 
material drawn from this database is summarized in the 
histogram of Figure 20. The difference in linewidths mea- 
sured off Parkes spectra with those measured off spectra 
from the Cornell or Nangay archives or from own new 
observations for 205 galaxies is < W^^^^^ - W^^^^es >= 
0.7 ± 0.5 with r.m.s. scatter 7.3 km s~^. The excellent 

® http: / /www.atnf.csiro.au/research/multibeam 
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very low scatter can be attributed in part to the fact 
that the galaxies in the comparison tend to be nearby 
and easily detected in HI. 

7. LINEWIDTH ADJUSTMENTS 

Several systematics affect linewidths. Two that are 
well understood are a slight relativistic broadening and 
broadening because of finite spectral resolution. SHGK 
discuss these matters at length and we adopt a simplified 
version of their solution. We adjust linewidths with the 
equation: 

Wm50 oA_.\ ^^-^ 



W 



m50 



2AvX 



where cz is the heliocentric velocity of the galaxy, Av 
is the spectral resolution after smoothing, and A is de- 
termined empirically. SHGK, with a closely equivalent 
formula, give a convoluted recipe for A. Their complex 
description may be appropriate in their case because of 
a coupling between signal-to-noise and their peak mea- 
surement, hence their linewidth measure. With our pa- 
rameter based on mean flux there is less systematic de- 
pendence on signal-to-noise. From tests on profiles with 
successively increased smoothing, we find broadening is 
statistically described by Eq. 1 if A = 0.25. This correc- 
tion is close to those advocated by Bottinelli et al. (1990) 
and Verheijen & Sancisi (2001). 

The measured profile linewidth, whether it is our VFmSO 
or another, is only a parameter of observational conve- 
nience and it is desirable to translate it into something 
more physically meaningful. Rectification to edge-on ori- 
entation is standard (division by sin i where i is the incli- 
nation from face-on). It is also common, but less secure, 
to adjust the observed linewidth to correspond statisti- 
cally with twice the maximum rotation velocity, Vmax- 
The adjustment is based upon samples with both global 
profiles and detailed rotation curves. TuUy & Fouquc 
(1985) investigated the matter and provided a descrip- 
tion that accounts for the effects of broadening by turbu- 
lent motions that transitions from a linear to quadratic 
correction as the unbroadened profile transitions from 
roughly boxcar in giant galaxies to gaussian in dwarfs. 
Their formula is: 



Wit = W! + Wl,[l - 2e-(^^/^-^)' 



(2) 



where the subscript £ stands for the observed linewidth 
measure, Wt,e is the turbulent broadening for that ob- 
served measure, and Wc,e characterizes the transition 
from boxcar to gaussian intrinsic profiles. In the case 
We = W20, TuUy & Fouquc (1985) recommended Wt = 
38 km s"^ and iVc = 120 km s^^. 

More recently, the problem has been studied in de- 
tail by Verheijen & Sancisi (2001). They compared 
global profiles with detailed rotation curve informa- 
tion for galaxies observed with the Westerbork Synthe- 
sis Radio Telescope. They determined that the TuUy- 
Fouque value taken for Wt was too large: that to get 
< Wn — 2Vmax sin i >~ with the transformation of 
W20 to Wr of Eq. 2 requires Wt = 22 km s'^ 

The transformations can be expected to be slightly 
different in detail with the new linewidth parameter 
Wm50- Comparisons have been made with 35 galaxies 



in the Ursa Major Cluster with rotation curves deter- 
mined from observations with the Westerbork Synthesis 
Radio Telescope and Vmax values reported by Verheijen 
(2001). As anticipated by Verheijen & Sancisi (2001), 
the effect of measuring the linewidth at a higher flux 
level above the basehne requires reduction of Wc^e and, 
especially, Wtj. The optimal flt results in the correla- 
tion seen in Figure 21. To differentiate from parame- 
ter variations discussed in earlier publications we deflne 
W^^ = W^ and flnd a best fit for the parameters in 

Eq. 2 with VFc,m50 = 100 km s^^ and Wi,rnba = 9 km s~^. 
With these parameters, observed Wrn:->() linewidths are 
transformed into W^x linewidths that agree with 2Vmax 
with an r.m.s. scatter of 12 km s~^ after deprojection 
(r.m.s. scatter 10 km in the line-of-sight) . This scat- 
ter is comparable to the Wm50 measurement accuracy. 
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Fig. 21. — Comparison between the global profile parameter 
and twice the maximum rotation velocity determined from 
spatially resolved rotation curves. 

A detailed study of the relationship between observed 
linewidths and the intrinsic kinematics of galaxies was 
carried out by Singhal (2008). Such a study is particu- 
larly important if the interest is to understand the physi- 
cal basis for the relationship between galaxy rotation and 
the light distribution. The slope of the correlation can 
be affected by the details of measurements and adjust- 
ments. For the practical matter of measuring distances 
the greater importance is to be consistent. 

8. THE ALL DIGITAL HI CATALOG IN EDD 

The All Digital HI catalog is accessed by selecting the 
'next' button on the EDD home page. It can be selected 
alone or in tandem with any of the other catalogs and ei- 
ther all or any fraction of the elements within the catalog 
can be selected. The tabular portion of the catalog is dis- 
played with the 'select' button and can be exported with 
the 'download' button. Upon entering the tabular dis- 
play, one can navigate to graphical displays of HI profiles 
by selecting on the common name of a galaxy (selecting 
on the PCC name in this, and any of the other catalogs, 
brings up a digital sky survey image of the galaxy). An 
example of what will be found is shown in Figure 22. 
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The galaxy seen in this case is PGC 19996 = ESQ 491- 
015. The image of the galaxy is drawn from the LED A 
website and displayed with a field of 10 arcmin, roughly 
the beam size of the GBT, hence a scale reasonable for 
an inspection for contamination from any near neigh- 
bors. The left profile was acquired by the authors with 
observations using GBT. The right profile is based on ob- 
servations with the Green Bank 140-foot telescope and 
was obtained from the Cornell archive. The two profiles 
are shown after treatment by the same analysis pipeline. 
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Fig. 22. — Example of a graphical display accessed from the All 
Digital HI catalog. 



The results of the analysis are carried to Table 4, the 
main catalog,® which includes the following information. 
The parameters given in columns 3-6 are averaged over 
multiple observations. Those in columns 7-18 are for an 
individual observation and the columns repeat if there 
are multiple observations. The first few lines of the table 
are given in the print version of this article. 

1. Principal Galaxies Catalog (PGC) name from the 
Lyon Extragalactic Database (LEDA). 

2. Common name (click on name to view profiles). 

3. Weighted average heliocentric velocity from all ac- 
ceptable profiles (km s~^). Weights are based on the 
inverse square of assigned errors. 

4. Weighted average linewidth approximating twice 
the maximum rotation velocity before projection from 
all acceptable profiles, Wmx (km s~^). 

5. Uncertainty attributed to the linewidth, the inverse 
square root of the sum of individual weights (km s~^). 

6. Number of acceptable profiles (errors less than or 
equal to 20 km s~^). 

7. Source of observation. Source codes are identified 
in Table 3 for sources incorporated at the time of publi- 
cation. 

8. Telescope and receiver. See Table 2 for more infor- 
mation. 

9. Heliocentric velocity, the midpoint between the low 
and high velocities that define Wmso (km s~^). 

10. WmbQ- Hnewidth at 50% of the mean flux per chan- 
nel where the mean flux is calculated within the 90% 
window, the range of velocities excluding 5% of the inte- 
grated flux at each end of the profile (km s^^). 

11. Linewidth corrected for relativistic and instrumen- 
tal broadening, W^gg = - 2AvX where A = 0.25 
and Av is the product of the values in columns 16 and 
17. 

12. Linewidth adjusted to statistically equal twice the 
maximum rotation velocity, before deprojection, derived 
from spatially resolved rotation curves (km s""'^). Statis- 
tically, Wmx ~ '^VmaxSini. This parameter is only con- 
sidered meaningful if ew < 20 km s~^. 

13. Uncertainty in the linewidth (km s"^). Uncer- 
tainties less than or equal to 20 km s^^ are consid- 
ered adequate for the purpose of determining a distance 
through the luminosity-linewidth correlation. An initial 
assignment of uncertainty, ew , is based on signal-to-noise 
(S/N): ew = Skm s'^ if S/N > 17; ew = 21.6-0.8S/N 
if 17 > S/N > 2; ew = 70 - 25S/N if S/N < 2. If 
the spectral resolution after smoothing is greater than 
this assignment then the error is increased to match the 
smoothed resolution. The uncertainty may have been 
modified by manual intervention to either increase it to 
above 20 km if the profile is too poor to be used for 
a distance measure or to decrease it to equal or below 
20 km if the profile is considered adequate for this 
purpose. The error value ew = 100 km s~^ is used to 
signal cases of confusion and the error value ew — 500 
km s~^ identifies a dubious/null detection. 

14. Signal-to-noise {S/N). The signal is the mean fiux 
per channel within the velocity range of the 90% window. 

^ Table 4 is provided with the on line version of the article in 
the Astronomical Journal or a continuously updated version can be 
downloaded from EDD at http:/ /edd. ifa.hawaii.edu upon selecting 
the All Digital HI catalog. 
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The noise is calculated over 100 channels on each side of 
the signal outside the velocity range of the 100% window. 

15. The flux is the signal integrated over all channels 
within the 100% window (Jy km s"^). No attempt has 
been made to account for flux lost due to the finite beam 
size. 

16. Channel resolution (km s^^). 

17. An integer N indicates averaging over N spectral 
channels in the profile that is displayed. 

18. Fm5o is the flux level at 50% of the mean, the level 
at which the measurement of W,n5o is made. 

The most recent status of the full Table 4 catalog is 
made available with the electronic version of this article. 

9. SUMMARY 

The purposes of the All Digital _H7 catalog in EDD are 
threefold: first, to make available the r(^sults of new ob- 
servations made of galaxies in the 21cm HI line; second, 
to make it easy to compare results with other observa- 
tions and link to other information about the targets; 
third, to present a reanalysis of all archival data avail- 
able in digital form to ensure that consistent linewidth 
information is available for essentially all galaxies that 
have been observed in HI. 

Our preferred linewidth parameter is Wm5o, the profile 
width at 50% of the mean flux within the velocity win- 
dow containing 90% of the total flux. This parameter is 
a variant of one of those introduced by the Cornell group 
(Springob et al. 2005). The availability within EDD of 
several very large catalogs of HI information facilitates 
comparisons and provides a way of culhng bad data. It 
is satisfying to see the tight correlations between alter- 
native profile descriptors. It is to be appreciated that 
various alternative profile descriptors have merit but it 
is important for the measurement of distances to main- 
tain consistency. After accounting for zero point off- 
sets, r.m.s. scatter between alternatives is at the level 
of 10 km s^^. Based on comparisons with detailed ro- 
tation curve information, a statistical transformation is 
proposed that takes the observed global linewidths to an 
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approximation of the maximum rotation velocity Vmax- 
Presently the catalog All Digital HI contains 15,411 
profiles providing information on 13,423 galaxies. Some 
57% of the profiles originate from the Cornell database 
(8740), 21% originate from the Nangay database (3225), 
7% come from Arecibo Legacy Fast Alfa survey (1047), 
another 7% were extracted from the Parkes archive (997) , 
1% are an Effelsberg contribution independent of the 
Cornell database (176), and 8% result from new observa- 
tions by our collaboration (1225). Currently profiles for 
10,580 galaxies are deemed acceptable. In 1,330 cases 
there are at least two acceptable profiles and in 81 cases 
there are three acceptable profiles. Inter comparisons be- 
tween sources suggest that the characteristic accuracy of 
an individual acceptable profile width is 7 km s~^. 



New observations across the entire sky have been made 
possible by access to three fine radio telescopes. We 
made early observations with the refurbished Arecibo 
Telescope and expect to add fresh material coming from 
the wide field multi-beam survey. At the Green Bank 
Telescope our ongoing project Cosmic Flows has been 
awarded the status of a Large Program. Observations of 
the deep southern sky began in 2009 with the Parkes 
Telescope in Australia. Equally important to us has 
been access to archival material from the Cornell Dig- 
ital HI Archive, the Nangay Radio Telescope HI profiles 
of Galaxies database, and the Australia Telescope on- 
line archive. Although electronic archives are a great 
innovation, the low-tech information gathered in the Pre 
Digital HI catalog retains great value and we thank 
Cyrus Hall for his role in assembling that material. We 
have made extensive use of NED, the NASA/IPAC Ex- 
tragalactic Database operated by the Jet Propulsion 
Laboratory, California Institute of Technology, and the 
HyperLeda database hosted at the Universite Lyon 1. 
Web access to the All Digital HI catalog is found at 
http://edd.ifa.hawaii.edu. 
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